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The Impact of Prior Assumptions on Bayesian Estimates of Inflation Parameters and 
the Expected Gravitational Waves Signal from Inflation 
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There has been much recent discussion, and some confusion, regarding the use of existing obser- 
vational data to estimate the likelihood that next-generation cosmic microwave background (CMB) 
polarization experiments might detect a nonzero tensor signal, possibly associated with inflation. 
We examine this issue in detail here in two different ways: (1) first we explore the effect of choice 
of different parameter priors on the estimation of the tensor-to-scalar ratio r and other parameters 
describing inflation, and (2) we examine the Bayesian complexity in order to determine how effec- 
tively existing data can constrain inflationary parameters. We demonstrate that existing data are 
not strong enough to render full inflationary parameter estimates in a parametrization- and prior- 
independent way and that the predicted tensor signal is particularly sensitive to different priors. For 
parametrizations where the Bayesian complexity is comparable to the number of free parameters 
we flnd that a flat prior on the scale of inflation (which is to be distinguished from a flat prior on 
the tensor-to-scalar ratio) leads us to infer a larger, and in fact slightly nonzero tensor contribution 
at 68% confldence level. However, no detection is claimed. Our results demonstratethat all that is 
statistically relevant at the current time is the (slightly enhanced) upper bound on r, and we stress 
that the data remain consistent with r = 0. 

PACS numbers: 98.80.Cq 



I. INTRODUCTION 

Shortly after its introduction [H H El H El E], in- 
flation was found to produce a nearly flat Gaussian 
spectrum of adiabatic density perturbations that could 
have been the seeds of observed structure in the Uni- 
verse m l Uni HH ini [13] . The simplest model of in- 
flation is that of a slowly rolling scalar field [HI [151 HB] i 
which naturally produces a close to flat primordial spec- 
trum. While the available observations are remarkably 
consistent with such a spectrum, unfortunately one can 
obtain virtually any scalar spectrum by simply adjust- 
ing the shape of the inflaton potential at early times, 
and therefore present results are strongly suggestive, but 
not yet unimpeachable evidence that inflation actually 
occurred. 

There are other more generic predictions of inflation 
that could be subject to testing, however. For example, 
a single rolling scalar field during inflation produces per- 
turbations that are very close to Gaussian. A detection of 
significant primordial non-Gaussianity in the cosmic mi- 
crowave background (CMB) could rule out simple slow- 



roll inflation [T7]. A second possibility is the fact that 
inflation generally produces a spectrum of tensor pertur- 
bations, which could, among other effects, produce an 
observable i3-mode polarization in the CMB [T8l [TQ] , al- 
beit plagued by uncertainties ^201 [5T] . Note that tensor 
perturbations are not the only source of _B-modc polariza- 
tion f22l [23l [Ml l25i , and noninflationary transitions can 
also produce a similar background [2Sll27j. Nevertheless, 
observation of both the scalar spectrum and the tensor 
spectrum could at least test the predictions of slow-roll 
(SR) inflation, through the consistency relation 
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where nx is the tilt of the tensor spectrum, r is the ratio 
of the amplitudes of the tensor and scalar spectra, ax 
is the running of the tensor spectrum and ns is the tilt 
of the scalar spectrum. A tensor spectrum has not been 
detected so far, and many future experiments have been 
proposed to search for a gravitational waves signal from 
inflation [Ml Hi [SHI El E3 [32 [31 [3S IMl [SZ] • 

With only observations that constrain the scalar spec- 
trum, one might hope to gain some information on the 
inflaton potential [351 IMl [ID] • However the plethora of 
different models of inflation make such a task difficult. 
Nevertheless, obtaining any information one can on the 
potential using the observed scalar perturbations could 
give information about the possibility of observing ten- 
sor perturbations. In Ref. ^T] the following relation be- 
tween the change in value of the scalar field cj) and the 
tensor-to-scalar ratio r, holding deep inside the slow-roll 



approximation, was pointed out, 
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where N is the number of e-folds the Universe grows dur- 
ing the change A(f) of the scalar field. That is, when fo- 
cusing on only a small part of the potential, and not nec- 
essarily on the whole duration of inflation, AA^ can cor- 
respond to a number much smaller than the total number 
of e-folds of inflation, A^ '^ 60 — 70. Hence, relation ^ 
relates the flatness of the potential to the relative ampli- 
tude of tensor perturbations. Throughout this work we 
use Gmp = h—c—\. 

As the only current probe of the mechanism of infla- 
tion is the observed spectrum of density perturbations in 
the Universe, Refs. |l2l SSI SH |45l |46l Si] concentrated 
on reconstructing the inflaton potential only in the ob- 
servational range. It was found in Ref. [46 that in the 
observational range naturally Acf) < mp and AiV ^ 22. 
This bound on AiV comes from the condition that the 
smallest observable modes actually freeze in [IT]. The 
bound on A0 can then be understood from Eq. ^ as 
the data prefer models with r smaller than at most 0.4 
(depending on the data used). 

The reconstruction of the inflation potential gave a 
weak upper limit on r, fully consistent with r = 0. More 
recently, however, at least one group has claimed that 
recent data imply a nonzero lower limit on r |48j . 

Obviously it is important to clarify this situation, es- 
pecially when the results would have such great signifl- 
cance, and when a dedicated satellite mission to probe 
for primordial B modes associated with a nonzero tensor 
signal, is being considered. 

In cases such as this, it is useful to take a Bayesian 
approach and to consider how effective the data really are 
at constraining parameters. Thus, one must consider not 
merely a posteriori probability estimations, but also the 
effect of prior assumptions (see [49 for some discussion of 
this issue). If the results depend crucially on the latter, 
then the parameter estimates one derives from the data 
must be taken with a grain of salt. 

The purpose of this paper is to explicitly explore pre- 
cisely this question at the current time, in order to help 
solidify expectations for future measurements of this im- 
portant and fundamental quantity arising from inflation. 
Specifically we first explore to what extent the priors one 
assumes in the analysis affect the expected value of v. 

One might argue that with little knowledge of the rele- 
vant physics, it is perhaps pointless to argue strongly on 
behalf of one set of priors or another at this point. It does 
make sense, however, to examine how robust the conclu- 
sions one draws are, under different prior assumptions. 
(See also work to appear by Vaudrevange and colleagues 
[50l 151].') This work focuses on the effect of taking a 
flat prior on the Hubble factor during inflation, and its 
derivatives with respect to the scalar fleld value 0. We 
will show that a change of parametrization, but not of 



physical model, in this case can lead to significantly dif- 
ferent bounds on parameters, some of which may mildly 
hint at a larger value of r as well. 

In this regard we note that in Ref. jlH] , a lower bound 
on the tensor-to-scalar ratio has been found which one 
might be tempted to ascribe to a choice of prior. An im- 
portant difference between their result and ours however 
is that their lower bound on the tensor-to-scalar ratio 
is caused by a theoretical prior: the models they allow 
can only be consistent with today's observed scalar am- 
plitude and tilt if the tensor-to-scalar ratio is significant. 
In the present work however, the prior on allowed mod- 
els is as broad as possible, a priori not ruling out any 
combination of inflationary parameters. 

Next, in order to explore the general signiflcance of 
any derived lower bound on r based on a choice of priors, 
we examine the Bayesian complexity parameter associ- 
ated with the current data. This gives a very useful tool 
to explore how many free parameters the data can use- 
fully constrain. As we demonstrate, for many inflationary 
parametrizations, the data are currently simply not pow- 
erful enough to add information beyond the prior, for all 
the parameters, explaining the prior-dependence of es- 
timates of r that we have found. Thus, we argue that 
existing data at best provide a rough upper bound on r, 
rather than providing a robust estimate of its posterior 
probability distribution. 

In Sec. HI] we discuss the relation between different flat 
priors, and explain how to translate posterior probability 
densities from one prior to another. In Sec. [ITT] we apply 
a flat prior on the value of the Hubble parameter and its 
derivatives during inflation, flt it to the data, and dis- 
cuss the results for both prior dependence and Bayesian 
complexity. We conclude in Sec.|IV[ 



II. PRIORS AND POSTERIORS 

When faced with the problem of estimating parameters 
from data, Bayesian inference enjoys a great popularity 
among cosmologists (see (55] for a recent review). An 
essential ingredient of any Bayesian inference is the prior 
distribution, which encodes our knowledge about these 
parameters before any data are taken. With a suitable 
basis of parameter space {xi} chosen, it is often tacitly 
assumed that the prior is flat - signifying our lack of 
information about this parameter in the absence of data. 
In other words, the prior probability of an interval Axi 
to contain the true value of the Xi is taken to be constant 
over the entire domain of deflnition of parameter space. 

However, while in some problems there is a naturally 
preferred basis of parameter space, this need not always 
be the case, and an alternative, equally well motivated 
parametrization {yi\ may exist. It is straightforward to 
show that generally, a prior in basis {x{\ does not corre- 
spond to the same prior in basis {j/i}. Labeling a prior A 
on {xi} hy TTx , the corresponding prior on {yi} is given 



by 
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where Vy = J d"y. Hence a flat, noninformative prior 
in one basis does not necessarily equal a noninformative 
prior in another, making the choice of basis equivalent 
to the choice of prior, and by consequence, extending 
its influence to the posterior and the inferred parameter 
constraints, unless the data become informative enough. 
This problem was identifled in [53l [Ml US] in the con- 
text of isocurvature models; here we will argue that in- 
flationary parameters, including estimates of the tensor- 
to-scalar ratio, can also be affected. 



which we shall refer to as post-sampling, is to take the 
converged chains of an analysis performed under prior 
(A), and multiply the weight of each point in the chain 
by the Jacobian as in Eq. (|6|. 

The advantage is that one can post-process readily 
available chains to present a new prior, which takes prac- 
tically no time. A possible drawback is that the chains, 
that converged for an analysis under prior A, may have 
too few (or no) points in the regions of parameter space 
important under prior B. 

The second option is explicit importance sampling of 
the second distribution, in which one, during the Monte- 
Carlo process, transforms the posterior of a point to re- 
flect the correct prior, by applying Eq. (|6l before the 
decision about acceptance of the point is taken. The 
advantage is that the convergence statistics will now be 
performed for the correct probability density, hence im- 
portant regions will have enough points in the chains. A 
drawback is that the analysis has to be performed from 
scratch, which can be time consuming. 



A. Importance sampling 



If from earlier analyses one knows that the bounds on 
parameters in set {xi} have Gaussian-like shapes, and the 
sets {xi} and {yt} are nonlinear ly related, one can expect 
that correlations between parameters in set {yi} are of 
nontrivial shape. In that case a Metropolis-Hastings al- 
gorithm, which is what we will use later on, will have 
difficulty exploring parameter space properly within an 
acceptable amount of time. A solution to this problem is 
importance sampling, which is the act of picking points 
according to one posterior distribution, but transforming 
the chance of accepting the point to another posterior 
distribution. In this way the algorithm walks through 
parameter space according to directions in the 'easier- 
to-explore' {xij-space, but performing the statistics as 
if working in {yi}-space. The resulting chains will be 
distributed according to the prior chosen in {yi}-space. 
Let A denote statistics with a ffat prior on {xi}, and 
let B denote statistics with a flat prior on {yi}. In the 
Metropolis-Hastings algorithm, the chance of accepting 
a proposed step is directly related to the ratio of its pos- 
terior and the posterior of the previous point. Hence a 
constant multiplicative factor in the posterior is irrele- 
vant, and we can neglect the volume term in Eq. Q. 
By consequence, any constant prior corresponds to a flat 
prior, such that the conversion to be done is 
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constant, 
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There are two distinct places in the analysis in which 
the correction for the prior can be applied. One option. 



B. Cosmological parameters 



When constraining the parameters of ACDM cos- 
mologies, it is a popula choice to take flat priors on 
{Q^ch^ ,Q.hh^ ,T,9} (the dark matter density, the baryon 
density, the optical depth to reionization, and the ratio 
of sound horizon to angular diameter distance at decou- 
pling, respectively) and for the primordial power spec- 
trum a flat prior on either {In Ag, ng, ag} (the amplitude, 
tilt, and running of the spectrum) or {In Ag, e^}, with {e^} 
some basis of slow-roll (SR) parameters. In the SR-basis 
of Hubble-flow parameters, the dynamics of inflation are 
hidden in these parameters by 
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where Ji\-ai — -ff*, '*' denotes evaluation at the pivot 
scale, and ' denotes derivation with respect to the fleld 
value of the inflaton. This means that in all these anal- 
yses the posterior distribution of the derived parameter 
-ffinf is obtained with a nonflat prior. 
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In Fig. 1 we show the Jacobian 
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FIG. 1: The Jacobian 



dyj 



for the coordinate transformation 



from set {xi} to set {yi}. It is clear that a flat prior on set 
{xi} strongly favors small values for H, compared to a flat 



prior on set {yi}, as ttI ' ~ tt^ 
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dyj I 



the coordinate transformation between sets 



FIG. 2: The marginalized posterior distributions of cold dark 
matter density {Qch'^), the baryon density (Q-t^h^), the ratio 
of sound horizon to angular diameter distance at decoupling 
(61) and the optical depth to reionization (r), under prior A 
(red dashed line), post-sampled from prior A to prior B (blue 
dotted line) and under prior B (black solid line). The post- 
sampled distributions are hardly visible as they practically 
completely agree with the importance sampled distributions. 
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corresponding up to a constant to the ratio iTy /ttx ■ A 
flat prior on set {j/i} (prior B) favors high values of H^, 
when compared to a flat prior on set {xi\ (prior A). 

Once the data come into play, the amplitude Ag will 
essentially be fixed. Since As ex i?^/e, higher values of 
Hif will need to be offset by higher values of e. In the 
slow-roll regime, e is related to the tensor-to-scalar ratio 
r by 



1 « 4m2 
16e — 



H' 



(12) 



and hence we can expect prior B to prefer a larger tensor 
contribution, compared to prior A. Equation (12 1 also 



shows that a flat prior on e roughly corresponds to a flat 
prior on r. 

III. FLAT PRIOR ON Hi^f 

In order to probe the scale of inflation, we numeri- 
cally integrate the perturbation equations of the infla- 
ton in a background described by a Taylor-expansion of 
H{(j>), as discussed in [inillSji and constrain the free pa- 
rameters using temperature and polarization data from 



the five year data release of the Wilkinson Microwave 
Anisotropy Probe (WMAP) satellite (WMAP5) [SB], as 
well as the power spectrum of luminous red galaxies 
from the Sloan Digital Sky Survey (SDSS-LRG) [57] . 
The parameter estimation is done using the Metropolis- 
Hastings algorithm, employing a modified version of 
the publicly available code CosmoMC [58 together with 
our own module for inflationary perturbations (which 
is available for download at jhttp : /7wwwlapp . in2p3 . | 
fr/~valkenbu/inf lationH/). The parameters describ- 
ing the model are either {fic^^i^b/i^,''', ^} + {xi} or 
{i^ch^,i^\jh^,T,6} + {ni}- We include the calculation of 
tensor perturbations. 

As a consequence of this exact numerical treatment 
of perturbations, we automatically impose a consistent 
inflationary prior (in the following this is referred to 
as "inflationary consistency"). By numerically integrat- 
ing the perturbation equations until the actual freeze- 
in of all modes, this method requires inflation to oc- 
cur over the observable range, which constrains param- 
eters more strongly than a naive application of the SR- 
approximation. As pointed out in Ref. [47^ , a naive im- 
plementation of the SR-approximation allows for incon- 
sistent models, for which small scale modes actually do 
not freeze in, even though the approximation provides 
a spectrum. We make no prior assumption on the to- 
tal length of inflation other than the length needed to 
produce the observed power spectrum of perturbations. 
That is, we remain conservative about the mechanism 
of inflation during the unobserved epoch, between hori- 
zon exit of the smallest observable modes and the end of 
inflation. 

As a consistency check we performed both described 
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FIG. 3: The marginalized posterior distributions of the pa- 
rameters describing the evolution of the Universe during in- 
flation, for the same analyses as in Fig. [2] Again the post- 
sampled distribution (blue dotted line) is hardly visible due to 
its good agreement with the importance sampled distribution 
(black solid line) . The main change under the transformation 



of priors is seen in the posterior of 
with the prediction in Fig. [T] 



rrip, in agreement 
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FIG. 4: The marginalized posterior distributions for the scale 
of inflation, Hint, and the scalar-to-tensor ratio, r, under 
prior A (red dashed line), under prior B (black solid lide), 
post-sampled to a Jeffreys prior on H^, (thin blue dotted line) 

and under a flat prior on {^s,lne, -jf^nip, "^a * mp} (green 

dashed, close to zero for both figures). Prior B corresponds to 
a flat prior on Hint, whereas prior A roughly corresponds to a 
flat prior on r, as explained in the text. Prior B pushes both 
i/inf and r up in value. Also shown is the mean likelihood 
over each (8 — 1) -dimensional parameter space for all values 
of H and r (dashed-dotted, magenta) . 



methods, post-sampling and importance sampling. In 
Fig. [2] we show the one dimensional marginalized pos- 
terior distributions of the four cosmological parameters 
describing the physics after inflation, comparing the anal- 
yses with prior A, the post-sampled chains from prior 
A to prior B, and the chains with prior B (importance 
sampled). The post-sampled and importance sampled 
analyses completely agree, which shows that the chains 
that converged under prior A have enough samples in the 
typical set of the posterior distribution under prior B. As 
should be expected, the four parameters shown in Fig. [2] 
are not affected by the change in prior. In Fig.lSlwe show 




FIG. 5: Two dimensional marginalized posterior distribu- 
tions for two illustrative cases, comparing prior A (red dashed 
line), prior B (black solid) and a noninflationary analy- 
sis, probing the four cosmological parameters plus the set 
{In j4s,ns, as,'"} describing the primordial spectrum (blue 
dotted line). All inner contours correspond to 68% CL 
bounds, all outer contours correspond to 95% CL bounds. 
Left: the curved correlation shape between d^Hi^f and Hi^{ 
illustrates the need for importance sampling when taking a 
flat prior on these parameters. Right: both r and ns are 
pushed up by prior B. 



the posterior distributions of parameters describing the 
inflationary evolution. The main change is in the pos- 
terior of the parameter I ^ ) rrip , which has a higher 

preferred value under prior B than under prior A. This 
result is in agreement with the expected effect, illustrated 
in Fig. [1] 

The scale of inflation and the tensor-to-scalar ratio are 
shown in Fig. |4] Both parameters, which are related, 
have a higher preferred value under prior B. 

For illustration, in Fig.|4] we also show a post-sampled 
distribution with a flat prior on {In H^, H'^, H'J ,H'J'}. 
Also this prior gives a higher value for H.^, and r than 
prior A does. Note that under both prior A and prior 
B, we find a lower r than the combined analysis of the 
WMAP three year data and SDSS-LRG in Ref. [57], 
which is due to our inflationary prior: integrating the 
modes until actual freeze-in, means demanding inflation 
for about 22 e-folds, which forces the inflaton potential 
to be relatively smooth. The smoothness of the potential 
pushes e down and thereby also r. Likewise the scalar tilt 
TT-s is pushed toward unity, as is shown in Fig. [5] where 
the two dimensional parameter correlations are shown for 
two illustrative cases, if inf versus d^H, and r versus ng. 
The former illustrates the nonlinear correlation between 
the parameters Hini and d^H in the data. The curved 
shape of the posterior probability contour indicates that 
it would take a Metropolis-Hastings sampler a long time 
to random- walk from one lobe to another if steps are only 
to be taken in either horizontal or vertical direction, or a 
linear combination of both, in the plane of this plot. By 
using importance sampling, steps are taken in correlated 
directions, significantly speeding up the process. The lat- 
ter shows both the effect of the inflationary prior, present 
in both analyses, and the effect of going from prior A to 
prior B. In both analyses the value of e is relatively close 
to zero, however it is larger under prior B. 



It is interesting to note that we also find an apparent 
lower bound on the scale of inflation, even for a flat prior 
on H^. In fact, this phenomenon is related to our choice 
of prior on e (or H'^, under prior B). Let us illustrate the 
effect in the example of prior A. The dislike of the data 
for a large tensor contribution leads to an upper bound 
on e due to Eq. (12 1. In order to reproduce the observed 
amplitude of fluctuations, As ~ ^*/e implies also an 
upper bound on H^. However, as a consequence of the 
flat prior on e, extremely small values of e, while cer- 
tainly allowed by the data, are assigned an exponentially 
suppressed probability, with a preference for e of the or- 
der of magnitude of its upper bound. Since ^s '^ H^l^i 
we also have a suppression of small values of i?*, with 
a peak slightly below the upper bound. If we instead 
take the prior to be flat on the logarithm of e (i.e., a 
Jeffreys prior on e), we do not see such a suppression. 
However, the results for the Jeffreys prior on e must be 
interpreted with care as they are highly dependent on 
the lower bound. For numerical reasons we took a lower 
bound of In e > —57. Had we taken an even smaller lower 
bound, the lines would be even closer to zero. A similar 
result can be anticipated for a flat prior on In H'^ in the 
{Hi} parametrization. That this dependence on the lower 
bound does not occur under the Jeffreys prior on H but 
a flat prior on H' is explained by the same reasoning as 
the apparent lower bound on H . 

In addition to the posteriors. Fig. [4] shows the mean 
likelihood over each (8 — 1) -dimensional parameter space 
for all values of H and r. This is a prior-independent 
quantity with no probabilistic information (i.e., it is not 
a probability density). It serves as an approximation for 
the profile likelihood. The profile likelihood is the best 
fit that can be achieved given a certain parameter value. 
The discrepancy between the mean likelihood and the 
various posteriors indicates that the various lower bounds 
in the posteriors are results of either volume effects in 
the process of marginalization, the choice of prior, or a 
combination of both. The mean likelihood shows that a 
good fit can even be achieved for very small values of r 
and ilinf. In fact the best fit we found lies at r = 4 x 10"^ 
and -ffinf = 4 X IQ^^mp. This certainly does not coincide 
with the peaks of the posteriors found for priors A and 
B. 

In Fig. [5] we also compare derived parameters from 
the different analyses with the bounds obtained on these 
parameters when using no inflationary prior and simply 
fitting a primordial power spectrum. 



Prior A 
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and a consistent tensor spectrum, described by r and con- 
sistency relations between the tensor spectral tilt (tt-t) 
and the scalar parameters, to the same data. Calculat- 
ing {7?inf,i?i'„f} from {As,ns,as,r} is done using the 
relations given in Ref . |45] . The curved shape of the cor- 
relation between H^^^f and iJi^f reflects the need for im- 
portance sampling. The 95% confidence level (CL) con- 



Prior B 



H'"H' 4 -, * 

„2 ' mp} 



Cb 6.98 ± 0.03 7.80 ± 0.03 7.76 ± 0.06 6.25 ± 0.8 

TABLE I: Bayesian complexity for different choices of prior. 
This number should be compared to the number of free pa- 
rameters, which is eight for all models considered here. 



tours under prior A correspond to the 95% CL contours 
from the spectral fit for small values of H[^f, whereas the 
95% CL contours under prior B correspond to the spec- 
tral fit for large values of -ffj'„f . An important conclusion 
to draw here is that both priors A and B allow most of 
parameter space that is allowed by the spectral fit, which 
has no inflationary prior. 

In the ng-r-plane, prior A clearly pushes r down with 
respect to merely performing a spectral fit because of the 
demand that infiation lasts long enough to produce the 
full observed spectrum under a fiat prior on virtually the 
same parameters, whereas prior B pushes r up, in spite 
of the same condition on the duration of infiation. 

In the absence of clearly favored theoretical models, 
the beauty of various priors is, alas, largely in the eye of 
the beholder. Nevertheless we emphasize that both priors 
A and B do not exceed the spectral limits but do probe 
practically the whole range allowed without the require- 
ment of persistent inflation. More interestingly, the 68% 
CL contour for prior B actually yields a nonzero lower 
bound for r. Marginalized over all parameters, the pos- 
terior of r gives, at 68% CL, 0.061 < r < 0.243, however 
at 95% CL r is still consistent with zero. While this may 
hint at a nonzero amplitude of tensor modes, our analy- 
sis underscores the prior model dependence of this result 
and thus we do not put much stake in it here. Neverthe- 
less, it does suggest that future polarization searches for 
tensor modes may have a better chance of detection than 
otherwise suggested. 



Bayesian Complexity 

When selecting models and priors, a quantity that 
can distinguish between models is the Bayesian evidence, 
which rewards both the predictivity and the conciseness 
of a model, and gives preference to the model with the 
best balance between the two characteristics. When the 
Bayesian evidence cannot distinguish between two mod- 
els, a secondary quantity to make the comparison is the 
Bayesian complexity [59 1 160 ] . 



Ci, = x^-x\d), 



(14) 



where the effective x^ is defined as — 21n£, with the like- 
lihood £., and 6 denotes the best fit point, and the over- 
line denotes the mean over the posterior. The Bayesian 
complexity measures the information gain when going 
from the prior to the posterior, and can be interpreted 



as a measure of the number of parameters the data can 
constrain in a model, or conversely the number of param- 
eters a model effectively needs to fit the data. Along the 
same lines, if the Bayesian complexity is smaller than the 
actual number of free parameters of a model, this could 
be taken as a sign that the model contains "unnecessary" 
degrees of freedom, i.e., parameters on which we do not 
gain information from the data. 

Note that the Bayesian complexity itself contains no 
information about the goodness of fit of a model, or the 
evidence of one model over another, but gives an addi- 
tional measure on the number of parameters in a model 
that is justified by the data. Without an evidence cal- 
culation, the complexity can still be useful for telling 
whether parameters are mostly bounded by either the 
data or the prior. In this work we are interested in the 
question which priors (with the same underlying model) 
constrain parameters beyond the constraining power of 
the data, and which priors allow the data to give infor- 
mation on parameters. 

The Bayesian evidence cannot be reliably calculated 
from the Markov-chain Monte Carlo (MCMC) chains ob- 
tained doing the parameter estimation, as these chains 
have a lack of information on the tails of the parameter 
distributions. An elaborate analysis would be necessary, 
e.g. using nested sampling [BT]. The Bayesian complex- 
ity, however, can be readily calculated from the chains. 
In Table |l] we show the Bayesian complexity for the same 
model under the different priors. 

For prior A we find a complexity of 6.98 ± 0.03, for 
eight free parameters. This indicates that the data do 
not give any information on one of the free parameters. 
Most likely this is due to -^jjr^mp which is more tightly 
constrained by imposing infiationary consistency than by 
the data, as explained in Ref. [47 . Compared to prior 
A, prior B has more volume in regions constrained by 
the data, increasing the amount of information gained 
and pushing up the complexity to 7.80 ±0.03. Compared 
to that, in the {As,ln€, -jY^irip, ^2 ' rrip} basis the op- 
posite happens, as e is pushed much closer to zero, such 
that the data give no new information on this parameter, 
decreasing the complexity. For the "phenomenological" 
parameter set {As,ns,as,r} with fiat priors, no infia- 
tionary consistency is imposed. Therefore, in this basis, 
as has no theoretical prior constraints and can be con- 
strained by the data. 

An increase in the complexities under prior B and the 
phenomenological prior compared to prior A should not 
be taken to mean that these prior choices are superior. 
Indeed, in all cases the complexity value is less than 8, 
which is the number of infiationary parameters under 
consideration. Rather, we take this as an indication that 
the data is highly sensitive to the choice of parametriza- 
tion of infiationary models, in particular the choice of 
prior distribution for r, and hence the posterior proba- 
bility densities refiect far more the choice of volume of 
prior parameter space than the impact of the data. 



IV. CONCLUSION 

Our paper makes explicit an important and well-known 
fact regarding the effort to constrain cosmological param- 
eters: the importance of prior assumptions in the analysis 
must not be neglected. We have demonstrated in a vari- 
ety of ways that this situation is relevant to the current 
issue of a possible nonzero value of r and expectations 
for future CMB missions. In the absence of clear theo- 
retical direction, it is important therefore to consider the 
divergence of results obtained by presumably equally well 
motivated priors. We have demonstrated here how to re- 
late fiat priors on different parametrizations of the same 
physics, and applied a change of paramctrization to the 
reconstruction of the infiaton potential, choosing a fiat 
prior on the parameters that may be better motivated by 
the physics of infiation, as opposed to parameters describ- 
ing the observable quantities. The main change, seen in 
Figs. I4] and [5] is an increase of the preferred value for 
the tensor-to-scalar ratio r, moving from < r < 0.18 to 
0.061 < r < 0.243 at 68% CL We stress once again that 
this new preferred range does not imply that the data 
now prefer a nonzero value of r, since at 95% CL r is 
consistent with zero under all used priors. The fact that 
for certain choices of paramctrization the complexity is 
less than the number of free parameters, eight, indicates 
that the data is currently insufficient to fully constrain 
the models. Rather, our calculation of the complexity 
shows that for prior B, which gives the increased range 
in r, the data are simply sensitive to more of the param- 
eter volume. Thus we consider the mean likelihood to 
be a more meaningful quantity here. In particular infor- 
mation on the parameters r and -ffinf under prior B is 
primarily gained on the upper bound. 

As a result we emphasize that the mean likelihood for 
the parameters we considered gives an indication of nei- 
ther a nonzero scale of infiation nor a nonzero tensor- 
to-scalar ratio. Nevertheless, the fact that one plausible 
paramctrization of the data increases the posterior prob- 
ability of these quantities to be nonzero, suggests that 
from a Bayesian point of view the motivation for probing 
for tensor modes may be slightly enhanced as a result of 
our analysis. 
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